Distributed tracing support #5078

CodeBlanch · 2024-12-03T21:51:07Z

This PR adds a new pipeline which will listen to DiagnosticSourceEventSource in order to receive Activity instances (aka spans) from a target process.

The goal is to use this in dotnet-monitor to export distributed traces (along with logs & metrics) using OpenTelemetry Protocol (OTLP).

/cc @noahfalk @samsp-msft @rajkumar-rangaraj @wiktork

src/Microsoft.Diagnostics.Monitoring.EventPipe/Configuration/ActivitySourceConfiguration.cs

wiktork · 2024-12-09T18:42:55Z

src/Microsoft.Diagnostics.Monitoring.EventPipe/Configuration/ActivitySourceConfiguration.cs

+            _ActivitySourceNames = activitySourceNames?.ToArray() ?? Array.Empty<string>();
+
+            if (_SamplingRatio < 1D)
+            {


I don't think we should be doing something this heavyweight and async in the ctor. If we want to validate this only works in version 9, we can fail the pipeline later or have the validation done upfront in a helper async method.

I moved it to a spot that felt more natural. Check it out, LMK.

src/Microsoft.Diagnostics.Monitoring.EventPipe/Configuration/ActivitySourceConfiguration.cs

wiktork · 2024-12-10T18:57:12Z

src/Microsoft.Diagnostics.Monitoring.EventPipe/Traces/ActivityData.cs

+
+namespace Microsoft.Diagnostics.Monitoring.EventPipe
+{
+    internal readonly struct ActivityData


This is a very large struct object. Perhaps a class or record is more suitable? I have not looked at usage yet.

It is a super big struct! But that doesn't concern me. It is handled correctly, passed by reference (out or in in our cases). What this code is trying to do is not create unneeded GC pressure in the monitoring app. It doesn't have to do that, but that was my thinking 😄

wiktork · 2024-12-10T19:08:21Z

src/Microsoft.Diagnostics.Monitoring.EventPipe/Traces/TraceEventExtensions.cs

+    internal static partial class TraceEventExtensions
+    {
+        [ThreadStatic]
+        private static KeyValuePair<string, object?>[]? s_TagStorage;


Can this be string/string?

It could be. Today everything pumped through DiagnosticSourceEventSource is in fact a string. But I went this direction because that is sort of an issue. When it comes to OTel distributed tracing, there are semantic conventions. Something like http.response.status_code should be an int to be compliant with the spec. I went with object so it could be fixed in the future if we add some typing info to the events or perhaps some conversion in the pipeline 🤷

wiktork · 2024-12-10T19:15:57Z

src/Microsoft.Diagnostics.Monitoring.EventPipe/Traces/TraceEventExtensions.cs

+                    status,
+                    statusDescription);
+
+                payload.Tags = new(s_TagStorage, 0, tagCount);


If I am understanding correctly, the tags are cumulative? So payload n contains the tags from 0 to n? Perhaps it would be easier to just associate the tags with the event and then aggregate them as needed. The other consideration here is multiple sessions. If you start reading distributed tracing, fail and then start again the tag list will never reset.

There's no aggregation of tags. Each span/Activity will just have whatever tags were added to it. What is going on here is I have some [ThreadStatic] storage for those tags. They are read off the EventSource and added to that storage. Then we hand out a ReadOnlySpan to that storage. So the consumer can get them, but can't move it off the stack. Same kind of idea as the big ActivityData struct. The goal here is to avoid having to allocate a list or array to store the tags for each span/Activity we receive.

That sounds reasonable. I guess my only concern would be that this is unbounded but in practice I can't imagine that there are so many different tags in metrics that this would begin to be an issue.

wiktork · 2024-12-10T19:17:21Z

src/Microsoft.Diagnostics.Monitoring.EventPipe/Traces/TracesPipeline.cs

+
+namespace Microsoft.Diagnostics.Monitoring.EventPipe
+{
+    internal class TracesPipeline : EventSourcePipeline<TracesPipelineSettings>


I think DistributedTracePipeline would work here.

I renamed it "DistributedTracesPipeline". Plural "Traces" because it felt better and we had plural in "Logs" pipeline. But happy to drop the "s" if you feel strongly about it.

src/Microsoft.Diagnostics.Monitoring.EventPipe/Traces/TraceEventExtensions.cs

wiktork · 2024-12-10T19:30:12Z

src/Microsoft.Diagnostics.Monitoring.EventPipe/Traces/TraceEventExtensions.cs

+
+                if (!s_Sources.TryGetValue(sourceName, out source))
+                {
+                    source = new(sourceName, sourceVersion);


So what happens if there are two events that have the same source name but different versions?

We'll just report the version of the first one we saw. It is kind of an edge case thing. I doubt multiple sources with different versions really happens in practice (it could) but even if it did I don't think there would be big impact. We could make the key a tuple if you are concerned. Or we could do it later if anyone raises an issue?

We can great a bug/issue to track for now.

samsp-msft · 2024-12-12T22:48:29Z

I am pretty sure @noahfalk is OOF now for the holidays. @tarekgh may be able to review, or @tommcdon may have somebody else who can review?

tommcdon · 2024-12-12T23:39:41Z

I am pretty sure @noahfalk is OOF now for the holidays. @tarekgh may be able to review, or @tommcdon may have somebody else who can review?

@wiktor can review the change and if the change is ready, we will merge into main. Then @noahfalk can optionally perform a post-checkin review in January.

noahfalk

A few suggestions/comments inline but for the most part if the dotnet-monitor crew is happy then I'm happy.

src/Microsoft.Diagnostics.Monitoring.EventPipe/DistributedTraces/DistributedTracesPipeline.cs

src/Microsoft.Diagnostics.Monitoring.EventPipe/EventSourcePipeline.cs

src/Microsoft.Diagnostics.Monitoring.EventPipe/DistributedTraces/TraceEventExtensions.cs

src/Microsoft.Diagnostics.Monitoring.EventPipe/EventSourcePipeline.cs

wiktork · 2025-01-16T00:15:01Z

src/Microsoft.Diagnostics.Monitoring.EventPipe/Configuration/ActivitySourceConfiguration.cs

+                filterAndPayloadSpecs.AppendLine($"[AS]{activitySource}/Stop{sampler}:-TraceId;SpanId;ParentSpanId;ActivityTraceFlags;TraceStateString;Kind;DisplayName;StartTimeTicks=StartTimeUtc.Ticks;DurationTicks=Duration.Ticks;Status;StatusDescription;Tags=TagObjects.*Enumerate;ActivitySourceVersion=Source.Version");
+            }
+
+            // Note: Microsoft-Diagnostics-DiagnosticSource only supports a


We solve this problem btw with a shared string across our configurations. Since we are not planning on operating triggers and OTLP at the same time right now, it's not a concern but if we want to do all of these, we'll have to aggregate the otlp configuration with the other ones. Let's save that for the future.

wiktork · 2025-01-16T00:28:46Z

src/Microsoft.Diagnostics.Monitoring.EventPipe/DistributedTraces/DistributedTracesPipeline.cs

+            eventSource.Dynamic.All += traceEvent => {
+                try
+                {
+                    if (traceEvent.TryGetActivityPayload(out ActivityPayload activity))


Couldn't we move the validation code here and get away with just one session? We are not validating prior to running anyway.

I tried it. Doesn't work. When DS<9 if you try to give it the new sampling spec it just fails silently. There's no way to detect something didn't work.

src/Microsoft.Diagnostics.Monitoring.EventPipe/DistributedTraces/TraceEventExtensions.cs

…pipeline

src/Microsoft.Diagnostics.Monitoring.EventPipe/DistributedTraces/DistributedTracesPipeline.cs

wiktork · 2025-02-26T17:30:41Z

/azp run

azure-pipelines · 2025-02-26T17:30:55Z

Azure Pipelines successfully started running 1 pipeline(s).

Distributed tracing support.

32ce9dc

CodeBlanch requested a review from a team as a code owner December 3, 2024 21:51

wiktork reviewed Dec 9, 2024

View reviewed changes

src/Microsoft.Diagnostics.Monitoring.EventPipe/Configuration/ActivitySourceConfiguration.cs Outdated Show resolved Hide resolved

wiktork reviewed Dec 9, 2024

View reviewed changes

src/Microsoft.Diagnostics.Monitoring.EventPipe/Configuration/ActivitySourceConfiguration.cs Show resolved Hide resolved

wiktork reviewed Dec 10, 2024

View reviewed changes

src/Microsoft.Diagnostics.Monitoring.EventPipe/Traces/TraceEventExtensions.cs Outdated Show resolved Hide resolved

wiktork reviewed Dec 10, 2024

View reviewed changes

CodeBlanch added 2 commits December 11, 2024 14:30

Code review.

0808ce8

Code review.

118fb2e

CodeBlanch added 2 commits December 18, 2024 11:22

Code review.

cd64d6a

Code review.

928a8f3

noahfalk approved these changes Jan 7, 2025

View reviewed changes

pharring reviewed Jan 8, 2025

View reviewed changes

src/Microsoft.Diagnostics.Monitoring.EventPipe/DistributedTraces/TraceEventExtensions.cs Outdated Show resolved Hide resolved

pharring reviewed Jan 8, 2025

View reviewed changes

src/Microsoft.Diagnostics.Monitoring.EventPipe/DistributedTraces/TraceEventExtensions.cs Outdated Show resolved Hide resolved

pharring reviewed Jan 8, 2025

View reviewed changes

src/Microsoft.Diagnostics.Monitoring.EventPipe/EventSourcePipeline.cs Outdated Show resolved Hide resolved

CodeBlanch added 2 commits January 8, 2025 09:57

Code review.

773a14f

Fixes.

7ca33e4

CodeBlanch commented Jan 8, 2025

View reviewed changes

src/Microsoft.Diagnostics.Monitoring.EventPipe/EventSourcePipeline.cs Outdated Show resolved Hide resolved

Tweak.

a213533

wiktork reviewed Jan 16, 2025

View reviewed changes

src/Microsoft.Diagnostics.Monitoring.EventPipe/DistributedTraces/TraceEventExtensions.cs Show resolved Hide resolved

CodeBlanch added 2 commits February 24, 2025 19:03

Merge branch 'main' into distributed-traces-pipeline

4cc0f22

Merge remote-tracking branch 'upstream/main' into distributed-traces-…

c6a3f1b

…pipeline

CodeBlanch added 4 commits February 25, 2025 11:10

Code review.

ff3e2dd

Revert changes.

e1c5392

Code review.

65439dd

Code review.

13f6b81

wiktork reviewed Feb 25, 2025

View reviewed changes

src/Microsoft.Diagnostics.Monitoring.EventPipe/DistributedTraces/DistributedTracesPipeline.cs Outdated Show resolved Hide resolved

Code review.

33b9295

wiktork approved these changes Feb 26, 2025

View reviewed changes

noahfalk enabled auto-merge (squash) February 26, 2025 01:25

noahfalk merged commit acccd11 into dotnet:main Feb 26, 2025
20 checks passed

CodeBlanch deleted the distributed-traces-pipeline branch February 26, 2025 18:39

hoyosjs mentioned this pull request Mar 10, 2025

Create merge commit to prevent drift #5327

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Distributed tracing support #5078

Distributed tracing support #5078

CodeBlanch commented Dec 3, 2024

wiktork Dec 9, 2024

CodeBlanch Dec 12, 2024

wiktork Dec 10, 2024

CodeBlanch Dec 18, 2024

wiktork Dec 10, 2024

CodeBlanch Dec 13, 2024

wiktork Dec 10, 2024

CodeBlanch Dec 18, 2024 •

edited

Loading

wiktork Jan 16, 2025

wiktork Dec 10, 2024

CodeBlanch Dec 12, 2024

wiktork Dec 10, 2024

CodeBlanch Dec 18, 2024

wiktork Jan 16, 2025

samsp-msft commented Dec 12, 2024

tommcdon commented Dec 12, 2024

noahfalk left a comment

wiktork Jan 16, 2025

wiktork Jan 16, 2025

CodeBlanch Feb 25, 2025

wiktork commented Feb 26, 2025

azure-pipelines bot commented Feb 26, 2025

Distributed tracing support #5078

Distributed tracing support #5078

Conversation

CodeBlanch commented Dec 3, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

CodeBlanch Dec 18, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

samsp-msft commented Dec 12, 2024

tommcdon commented Dec 12, 2024

noahfalk left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

wiktork commented Feb 26, 2025

azure-pipelines bot commented Feb 26, 2025

CodeBlanch Dec 18, 2024 •

edited

Loading